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ABSTRACT 

Pupil-mapping is a technique whereby a uniformly-illuminated input pupil, such 
as from starlight, can be mapped into a non-uniformly illuminated exit pupil, such that 
the image formed from this pupil will have suppressed sidelobes, many orders of mag- 
nitude weaker than classical Airy ring intensities. Pupil mapping is therefore a can- 
didate technique for coronagraphic imaging of extrasolar planets around nearby stars. 
The pupil mapping technique is lossless, and preserves the full angular resolution of 
the collecting telescope, so it could possibly give the highest signal-to-noise ratio of 
any proposed single-telescope system for detecting extrasolar planets. A planet fainter 
than 1CT 10 times its parent star, and as close as about 2\/ D should be detectable. We 
derive the 2-dimensional equations of pupil mapping for both 2-mirror and 2-lens sys- 
tems. We give examples for both cases. We derive analytical estimates of aberration 
in a 2-mirror system, and show that the aberrations are essentially corrected with an 
added reversed set of mirrors. 

Subject headings: Extrasolar planets, coronagraphy, point spread function, pupil map- 
ping, apodization 

1. Introduction 



To successfully detect and characterize extrasolar terrestrial planets around nearby stars, it 
necessary to isolate the light of a planet from that of the parent star to better than about 1CT 10 
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at visible wavelengths or 1CT 6 at thermal infrared wavelengths (DesMarais et al. (2002)). Two 
general types of space-based instruments have been proposed to do this, visible coronagraphs and 
infrared interferometers. 

At present, NASA plans to launch the Terrestrial Planet Finder-Coronagraph (TPF-C) in 2014, 
and the Terrestrial Planet Finder-Interferometer (TPF-I) in 2019. ESA plans to launch an infrared 
interferometer, DARWIN, around 2015. Each of these observatories appears to be feasible using 
current or expected technology, and for each there are several alternative architectures now under 
study. The present paper discusses the underlying mathematical principle of the coronagraph con- 
cept known as pupil mapping (Traub and Vanderbei (2003)), alternatively called intrinsic apodiza- 
tion (Goncharov et al. (2002)) or phase-induced amplitude apodization (PIAA) (Guyon (2003), 
Galicher et al. (2004)). 

The basic idea of pupil mapping is that the uniform intensity of starlight falling on the input 
pupil of a telescope can be mapped, ray by ray, to a non-uniform intensity in an exit pupil, such 
that the image of a star will be highly concentrated with minimal sidelobes. The goal is to reduce 
the sidelobes to less than 1CT 10 within a very few diffraction widths from the central star. This 
permits us to separate starlight from planet-light. For perspective, sidelobes less than 1CT 10 are 
roughly 8 orders of magnitude lower than the Airy-ring pattern that would be expected from an 
idealized conventional telescope image. 

Several other coronagraphic methods have been suggested. One of the first was the idea of 
a square apodized pupil (Nisenson and Papaliolios (2001)), in which the transmission function of 
the pupil is tapered to zero at the edges, thereby reducing the sidelobes, but at a loss of light and 
angular resolution. Another idea is the shaped pupil (Spergel (2000), Vanderbei et al. (2003a,b,c, 
2004)), in which the pupil is covered by an opaque mask that has carefully-shaped transmitting 
cut-outs, thereby strongly reducing the sidelobes in two azimuthally-opposite segments, but again 
at a loss of light as well as some loss of angular resolution. A different approach was taken by 
Kuchner and Traub (2002), Kuchner and Spergel (2003), and Kuchner et al. (2004), who proposed 
a family of image-plane band-limited masks that would block starlight and transmit planet light, 
with little loss of planet light and nearly full angular resolution. Yet another idea was suggested by 
Levine et al. (2003) who combine pupil- shearing with single-mode fibers, a combination that also 
potentially has good transmission and angular resolution. Other methods have also been proposed, 
some of which are more limited in spectral range; see, for example, Aime and Soummer (2003). 

Even before Guyon first suggested pupil mapping for TPF-C, there was an abundant literature 
on the topic of beam-shaping, in both the radio astronomy community (in order to optimally cou- 
ple a telescope beam into a detector horn), and in the laser community (to reshape the Gaussian 
beam from a laser into a more generally useful uniform-intensity beam). The laser beam shaping 
is closely related to the present pupil mapping for astronomy, because the laser beam work par- 
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ticularly aimed at maintaining a flat wavefront after the shaping optics. In particular we mention 
the U.S. patent awarded to Kreuzer (1969), and a more recent, but representative, application of 
this work by Hoffnagle and Jefferson (2003). In fact, one of the equations that we derive in the 
present paper (equations (53) and (54) in Theorem 2), for the case of pupil mapping with lenses, 
is identical to that derived by Kreuzer, the only difference being that the latter was trying to make 
a uniform-intensity beam from a Gaussian one, and we are trying to do essentially the reverse of 
that. 

The present paper is an extension of our previous one (Traub and Vanderbei (2003)), in that we 
move from an idealized 1-D treatment to the more physical 2-D case, in addition to which we now 
also present equations for pupil mapping with lenses as well as mirrors. We furthermore include 
an analytical development of off-axis aberrations, and a method for removing the aberrations of 
off-axis images. We give several illustrative examples of pupil mapping, including a section show- 
ing how one simple type of pupil-mapping can directly yield the entire family of Cassegrain and 
Gregorian paraboloid-based telescopes, including the simple plane-mirror periscope as a special 
case. 

Our approach in this paper is the same as in Traub and Vanderbei (2003), namely to develop 
the analytical basis of pupil mapping, and explore the consequences. For some questions, including 
that of fabrication, it will be necessary to carry out numerical investigations, which we defer to a 
future paper. 

The organization of the present paper is as follows. In Section 2, we define the ray-by-ray 
mapping function R. In Section 3, we derive the differential equations describing the mirror sur- 
faces in terms of R. In Section 4, we show how R can be calculated from the function A 2 , where 
A 2 is the point-by-point ratio of output to input intensity across the pupil. In Section 5, we give 
three examples of pupil-mapping, for constant, Gaussian, and prolate-spheroidal-like functions. In 
Section 6, we extend the theory to include on-axis mirror systems. In Section 7, we extend the the- 
ory to 2-lens systems. In Section 8, we show how to modify the equations to apply to elliptically 
shaped pupils. In Section 9, we estimate the magnitude of off-axis aberrations in 2-mirror systems 
and show that the aberrations are nearly eliminated with a pair of identical but reversed mirrors. 



2. Pupil Mapping 

Consider a 2-mirror optical pupil-mapping system endowed with a Cartesian coordinate sys- 
tem in which the z-axis corresponds to the optical axis so that the pupil and image planes are 
parallel to the (x, y)-plane. The first mirror's projection onto the (x, y)-plane is a circle of radius a 
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centered at (0, 0): 

M 1 = {(x,y):x 2 + y 2 <a 2 }. (1) 

The second mirror has radius a and is shifted along the rr-axis by a distance 5. Its projection on the 
(x, y) -plane is 

M 2 = {(x,y):(x-5) 2 + y 2 <~a 2 }. (2) 
The displacement 5 can be any real value, but if the mirrors are nonoverlapping then we need 

\5\>a + h. (3) 

If 5 = 0, then the optics are concentric. We show two such examples; for mirrors in Section 6.2, 
and for lenses in Section 7. We introduce polar coordinates r and 9 for Mi, and f and 9 for M.2- 
Hence, 

x = r cos 9, y = r sin 9 (4) 

x = f cos 9 + 5, y — fsva.9. (5) 

Note that r = and f = refer to physically different locations (the centers of M\ and M.2, 
respectively), but x = and x = are the same physical location. The mirror surfaces are at 
z = h(x, y) for the first mirror and z = h(x, y) for the second mirror. Light enters the system from 
above, reflects upward off from the first mirror and then impinges on the second mirror, which 
reflects it back downward. 

A pupil mapping is determined by specifying a one-to-one and onto mapping between the two 
mirrors, or equivalently between their two projections M.\ and M. 2 . In general, such mappings 
could be rather elaborate. To keep the design (and analysis) simple, we assume that polar angle 9 
on the first mirror maps to the same polar angle on the second one. Hence, the pupil mapping is 
completely determined by giving a function R from mirror-one radii to mirror-two radii. Hence, 
we have 

r = R(r). (6) 

To be one-to-one, R must map [0, a] monotonically onto [0, a]. In such a case, there is an inverse 
function R: 

r = R(f). (7) 
The fact that these functions are inverse to each other is expressed as 



R(R(r)) = r, R(R(r)) = r. 



(8) 
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3. Mirror Shapes 



Let I and R denote the unit incidence and unit reflection vectors at a mirror surface. Let N 
denote a vector normal to a point on a mirror surface. At point (x, y, h(x, y)) on the first mirror, 
these vectors are 



(9) 
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where h = h(x, y), h = h(x, y), subscripts x and y denote partial differentiation with respect to 
the indicated variable, i.e. 

dh(x,y) 



h x (x,y) = 



dx 



and 



S(x, y, x, y) = a/ (x - x) 2 + (y - y) 2 + (h - hf 



(10) 
(11) 



denotes the distance from point (x, y, h) on the first mirror to the corresponding point (x, y, h) on 
the second mirror. The shape of the first mirror is determined by requiring that /, N, and R are 
coplanar and that angle of incidence equals angle of reflection. In other words, it is required that 
I x N = —R x N. Computing these cross products, we get 



I x N = 



hy 

-h x 




Rx N 



y-y + h y {h - h) 

x — x — h x (h — h) 

h x (y-y) -hy(x-x) 



l 



(12) 



where S is shorthand for S(x, y, x, y). Equating the first two components, we can easily solve for 



h x and h y : 



h x = 



x — x 



S + h-h' 



y-y 



s + h-h 



(13) 



Because the light after reflecting off from the second mirror should be traveling in the z 
direction once again, it follows from simple geometry that 



h x = h x and hy = h y . 



(14) 



As in our earlier paper Traub and Vanderbei (2003), we have the following important lemma 
that tells us that the optical path length for an on-axis source through the system is constant. 



Lemma 1 P = S + h — his a constant. 
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Proof. The quantity of interest, S + h — h, is a shorthand for S(x, y, x, y) + h(x, y) — h(x, y). 
Although it appears that there are four independent variables, there are in fact only two since 
variables x and y can be considered to be functions of x and y (or vice versa). Hence, we need 
only show that the derivatives with respect to x and y vanish. Using (14), it is easy to check that 

— (S(x, y, x, y) + h(x, y) - h(x, y)J = G(x, y) ( 1 - — J - K(x, y) — , (15) 



where 



and 



G(i,y) = i - X + f l - h K h, (16) 

K(x,v) = 9 - V + f 1> - h) +h,. (17) 

Putting the two right-hand terms over a common denominator (S) and then substituting the expres- 
sions in (13) for h x and h y , it is easy to see that both G and K vanish. Hence, the derivative with 
respect to x vanishes. That the derivative with respect to y also vanishes is shown in precisely the 
same way. □ 



An important consequence of this Lemma is a simple decoupling of the differential equations 
for the mirror shapes. 

Theorem 1 The shape of the first mirror is determined by the following pair of ordinary differen- 
tial equations: 

i x-x y-y 

K = h y = — — , (18) 

-TO -ro 

where x and y are known functions of x and y. Similarly, the shape of the second mirror is 
determined by: 

h = h = (19) 

-TO -TO 

where x and y are known functions ofx and y. 



Proof. The equations are an immediate consequence of Lemma 1, (13), and (14). The "known" 
formulae relating x and y to x and y are easy to compute from the functions R and R. For example, 



f R(r) R(\/x 2 + v 2 ) 
x = rcos6 + 5 = -rcos9 + 5 = -^x + 5 = K \ Z JUx + 5. (20) 

r r ^x 2 + y 2 
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The other relations are derived in a similar manner. □ 



We have two approaches that can be followed in solving for the mirror shapes. First, we 
can solve each of equations (18) and (19) separately; an advantage of this, for the case when the 
differential equations must be solved numerically, is that we can choose the x or x coordinates 
independently, say with equal step sizes. Second, we can solve either (18) or (19), analytically or 
numerically, then find the other shape algebraically, as follows. From Lemma 1 we can solve for 
S and then square it to see that 

S 2 = (p - h + h)j 2 = P 2 - 2P (h -h) + (h- h) 2 . (21) 

Substituting S 2 = (x — x) 2 + (y — y) 2 + (h — h) 2 , we see that the quadratic terms involving h — h 
cancel and so we are left with a simple calculation for the difference: 

h - h = £ - (*-»>' + «-»>', (22) 
2 2P v 

Hence, if we already know either h or h, we can use (22) to compute the other one. An advantage 
of this method, for the case of numerical solutions of the differential equations, is that the step 
size chosen for one mirror will automatically map to the corresponding step on the other mirror. 
Another way to say this is that the ray intersections on the first mirror map to the intersections of 
the same rays on the second mirror, facilitating visualization of the ray paths. 

Let us return now to polar coordinates. Put 

H(f,9)=h(x,y), (23) 

where x = f cos9+5 and y = f sin 9. Using (4) and (5), we get the following differential equations 
fovH: 

r r . n ( x — 5) cos 9 + (y — y) sin 6 (r — f) — 5 cos 9 

Hf = h i cos9 + hySm9 = ± — — ^ = ^ '— (24) 

"o "o 

and 

— (x — x)rsin9 + (y — y)f cos 9 ^fsin^ by 
\ Po = Po 

We can integrate in the radial direction along f (holding 9 constant) to get 



H 9 = —h~r sin 9 + h^r cos 9 = = = — . (25) 

V Pn Pn Pn 



r 



H(f , 0) = g (0 , 0) + f R[U) ~ " - 6 C ° S °- du= H(0, 0) + 1 [ {R{u) -„ )d „- S l^l. (26) 
J Po Po J Po 
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Note that the last term (Sf cos #/P ) corresponds to the "tilt" of the mirror to accomodate the off- 
axis alignment of the two mirrors (i.e., the fact that 5 ^ 0). Furthermore, this tilt term is especially 
simple when written in cartesian coordinates: 



8f cos 9 5(x — 5) 



(27) 



It will be useful to have a polar-coordinate version of h(x, y) as well. Put 

H(r,9) = h(x,y). (28) 
Then from equation (22) and (4)-(5) we find 

g (r, 0) = H(r, e) -*L-( f ~^ + -/)' C ° S W + * . (29) 
2 2F 



4. Mapping and Apodization Relationship 

In this section, we show how to relate the pupil mapping function R(f) to a specified ampli- 
tude apodization function A(f). Here A(f) is the geometric gain factor relating the electric field 
amplitude in the entrance pupil E(r) to that in the exit pupil E(f). We assume that A is real. We 
have 

E{r) = A{r)E{r) (30) 

where, as above, r and f are related by the specified geometric mapping (equations (6) and (7)). 

We now invoke conservation of energy and require that the intensity of light in the entrance 
pupil I(r) be related to that in the exit pupil 1(f) by 

I(f)fdfdBdX = I{r)rdrd9d\ (31) 

where A is wavelength and intensity is in units of energy per unit time per unit area per wavelength 
interval. 

We then have that I = \E\ 2 and I — | ^ | 2 , in appropriate units. Combining (30) and (31) we 

get 

A 2 {f)fdf = rdr (32) 
where r = R(f). From this it follows that 

R(r)R'(r) = A 2 {r)f. (33) 
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In other words, 

(R{f) 2 )' = 2A 2 {f)f (34) 

which can be integrated to yield 



R{f) = ± 



V 



2A 2 (s)sds. (35) 



Equation (35) is the fundamental relation that connects the amplitude apodization function 
A to the ray-mapping function R. In the remainder of this paper, we will develop several results 
that follow from this relation: In Section 5, we give some explicit examples of pupil mapping. In 
Section 6, we discuss extensions to different geometries. In Section 7, we discuss replacing the 
off-axis mirrors with on-axis lenses and derive the corresponding equations for their surfaces. In 
Section 9, we return to pupil-mapping mirrors and explore the off-axis aberrations analytically. 

We end this section by remarking that the relationship between the functions R' and A given 
for the 1-D case in Traub and Vanderbei (2003) (equation (32)) was derived incorrectly. Fortu- 
nately, the correction is simple: just replace A with A 2 in equation (32) and in all subsequent 
equations that depend on this one. 



5. Examples of 2-Mirror Pupil Mapping 

5.1. Constant Mapping Function 

In this subsection, we choose a constant mapping function and explore its implications. We 
will find that this simple case leads to a total of 6 different types of 2-mirror systems, depending 
on the value of the constant. This family includes the familiar afocal Cassegrain, Gregorian, and 
periscope systems, plus variants on these. 

Let us take the intensity mapping function to be 

A 2 {?) = a 2 (36) 

so that the amplitude mapping function is 

A{r) = a (37) 

where we choose the positive root, but allow alpha to take on any real value, positive or negative. 
From equation (35), the ray-mapping function is then 

r = R(f) = ar, (38) 
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where again we choose the positive root, with no loss of generality. 

Inserting this into equations (26) and (29), and requiring that the first mirror be centered at 
the origin, H(0, 0) = 0, we find the following solutions for the first and second mirror surfaces, 
respectively: 

For q^I, these equations describe paraboloidal mirrors, formed by the rotation of a parabola 
about an axis parallel to the z axis. For a — 1, they describe flat mirrors tilted about axes parallel 
to the y axis, i.e., a simple periscope. 

Setting first derivatives of H and H to zero, we find that the axes of the H and H paraboloids 
are both centered at 

^=(^4 <4i) 

The height of the vertex of each paraboloid, either real or projected, is at 

Hm m = — . — - (42) 

2F (a - 1) 



and 



The difference in heights is 



max ~ 2 2P (a-l)' 



^max " ^min = P /2 (44) 

as expected from the property of P as the additional optical path imposed by the two mirrors. 

The second derivatives of H and H are the inverse of twice the paraxial focal lengths, i.e., 
H rr = 1/{2F) and H ff = —1/(2F), where the signs are chosen to show that the light is incident 
on the mirrors from above and below, respectively. Hence, we find these paraxial focal lengths: 

F = < 45 > 

2(at — 1) 

F = -— (46) 

2a-l) V 
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The sum of these is F + F = Pa/ 2, indicating that the parabolas have the same focal point (given 
that their vertices are separated by this amount too), as is expected for a system with parallel light 
incident on it, and exiting from it. 

These properties are summarized in Table 1, and illustrated in Figure 1 which shows ani-2 
cut through the mirrors and edge rays. 



In the context of searching for extrasolar planets, an ideal coronagraph would concentrate the 
incident starlight to a very compact image, free of the bright Airy rings that would otherwise be 
present. The image-plane electric field is the Fourier transform of the pupil-plane electric field, 
and the Fourier transform of a Gaussian is a Gaussian. Therefore, if we can generate a Gaussian 
amplitude distribution, the image from such a beam will have small sidelobes, within the approxi- 
mation that the integral over a finite pupil is approximately the same as the integral over an infinite 
range. (This latter restriction is removed in Sec. 5.3 where the amplitude will be made equal to a 
prolate spheroidal function.) 

For the Gaussian electric field case, we have the amplitude mapping function 



5.2. Gaussian Mapping Function 



A(r) = ce 2a- 



(47) 



which generates the ray-mapping relation 




(48) 



The constant c is determined by the requirement that R(a) 



a and comes out to be 



a 



(49) 



ay/il-e-" 2 /* 11 )' 



Hence, 




(50) 



The equations for the surfaces H and H can be solved numerically. Typical results are shown 
in Figures 2 and 3, where both the side view and end view are given, and selected rays are drawn. 
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5.3. Prolate Spheroidal Wave Functions and Related Apodizations 

On a finite interval, the prolate spheroidal wave function plays a role similar to that of a 
Gaussian on the infinite interval. In particular, for both cases, the product of the width of the 
function and the width of its Fourier transform is minimal, compared to all other functional forms. 

Thus for a coronagraph, a likely optimal design is one in which the electric field across the 
pupil of an imaging lens is distributed as a prolate spheroidal wave function for 1-D optics and 
as a generalized proloated spheroidal wave function for 2-D optics (see Slepian (1965)). Kasdin 
et al. (2004) were the first to notice that these functions play a fundamental role in shaped-pupil 
coronagraphs. But, it turns out that one can get a slightly tighter inner working angle by using 
an apodization function that is specifically designed to achieve the desired contrast in the spec- 
ified dark zone. Optimization models designed in this manner are described in Vanderbei et al. 
(2003b,c). We show in Figure 4 one such function. In Figure 5 we show the point-spread function 
illustrating that the sidelobes are lower than 1CT 10 in intensity, nominally adequate for searching 
for Earth-like planets. 



6. Extensions 

In this section, we briefly consider some generalizations to the basic set up considered so far. 

6.1. Cassegrain vs. Gregorian Design 

As we saw in (34), the apodization function A is related to the square of the transfer function 
R and, in (35), there arose two choices for the square root. Suppose we choose the positive root. 
Figure 2 shows two views of the resulting optical system associated with the apodization function 
shown in Figure 4. The first view is of the (x, z)-plane whereas the second view is of the (y, z)- 
plane. Since the secondary is convex, we refer to this design as a Cassegrain design. 

If we choose the negative square root, then we arrive at a different optical system — one with 
a (rather poor) focal plane between the two mirrors. This optical system is shown in Figure 3. We 
call this design a Gregorian design, because the secondary is concave. 

The choice between a Cassegrain versus a Gregorian design is largely a question of manufac- 
turability; mathematically they perform the same. 
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6.2. Concentric (On-Axis) Designs 

When the second mirror is assumed to be of smaller aperture than the first, it is possible to 
consider an on-axis design. In this case, R must map the annulus defined by the interval [a, a] of 
radii bijectively onto [0, a]. All formulas in the previous sections remain unchanged. 



7. Refractive Elements 

We can replace mirrors M\ and M.2 with coaxial lenses. We require that M.\ and M. 2 are 
piano on their outward-facing surfaces, where the entering and exiting rays are parallel (see Figure 
6). Assume that the lenses have refractive index n, which is constant over the desired band of 
wavelengths. It is easy to show that the mirror figures depend only on radius r. Hence, mirror 
Mi's lower surface is defined by a function h(r) and .M 2 's upper surface is given by a function 
h(f). Repeating the derivation at the beginning of section 3 using the refractive form of Snell's law 
we derive the following analogue of equation (13): 

hr(r) = — "—J- . (51) 

nS + h — h 

And, corresponding to equation (14), we have 

h f .{r) = hr(r). (52) 

Interestingly, Lemma 1 changes to: 

Lemma 2 Q = + h — h is a constant. 

Hence, the analogue of Theorem 1 is more complicated: 

Theorem 2 The shape of the first lens is determined by the following differential equation: 

K = (53) 

^n 2 Ql+{n 2 -l){r-ff 

where f is a known function ofr. Similarly, the shape of the second lens is determined by: 

t — f 

hf = ; (54) 

^n 2 Ql + (n 2 - l)(r - r) 2 

where r is a known function off. 
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Proof. From (51) and Lemma 2, we see that 

K = n TJ-^u w (55) 
Qo + {n - l/n)S 

Since S 2 = (r — f) 2 + (h — h) 2 , we can write the invariant Q as 

Q = -S+ ^S 2 - (r - f ) 2 . (56) 

We can rearrange (56) into a quadratic expression in 5 and then use the formula for the roots of a 
quadratic equation to express S in terms of Q and r — f: 



5 = * j . (57) 

J- 2 

r) ^ 



From this expression, we get that 



Qo + (n - l/n)S = n 2 Q 2 + (n 2 - l)(r - f) 2 . (58) 
Combining (55) with (58), we get the result claimed. □ 



Interestingly, the equivalent of equations (53) and (54) was found by Kreuzer (1969), in a 
patent application for a pair of lenses that could change the Gaussian-distributed beam from a laser 
into a sometimes more useful uniform-intensity beam. Since light is reversible, Kreuzer's goal and 
ours are essentially the same. Both before and after Kreuzer's discovery, there have been many 
papers pursuing similar ends; as a recent example, we mention Hoffnagle and Jefferson (2003), 
who designed and fabricated a pair of convex lenses for this purpose. Our example, shown in 
Figure 6, was derived using equations (47)-(50), with a value c > 0, and gives one positive and 
one negative lens. If we had used c < 0, we would have found both lenses to be positive, and this 
is effectively what Hoffnagle and Jefferson did in their experiment. 



8. Mapping Elliptical Pupils to Circular Ones 

The current baseline design for the Terrestrial Planet Finder space telescope involves an 8 x 
3.5m primary mirror and a square downstream deformable mirror for wavefront control. This 
disparity of shapes introduces the need to reshape the pupil using anamorphic mirrors, which 
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introduces the opportunity also to develop unique pairs of anamorphic mirrors so as to apodize 
the exit pupil as desired for high-contrast imaging. We discuss this design problem here. So, we 
assume that mirror Ai 1 is elliptical having semimajor axes a and b: {(x,y) : (x/a) 2 + (y/b) 2 < 1}. 
We assume that M. 2 is a circular mirror of radius a, that M.\ is uniformly illuminated, and that the 
beam leaving mirror M. 2 is apodized according to a given function A{f) that depends only on the 
radius f. In this setup, the obvious hope would be that the transfer function R from the circular 
case can simply be stretched as needed at each angle 0: 

r = R^ \/a 2 sin 2 9 + b 2 cos 2 9^j (59) 

(the scaling of r inside the function R is chosen so that the result will be between and a). But, 
this transformation implies that the simple angular map 9 = 9 is no longer adequate; we need to 
introduce a 0-transfer function 

9 = 6(0) (60) 

and its inverse 

= 6(0). (61) 
Given this, the inverse transformation for R is easy to find: 

r = ' =ig(r). (62) 



^/a 2 sin 2 9(0) +b 2 cos 2 6(0) 



rdrd9 = 2 ^,~, ,„ T7Z7 ^R{r)R'{f)Q\9)dfd~9 (63) 



From the usual change of variables, we get that 

a 2 b 2 /d 2 
a 2 sin 2 6(0) +b 2 cos 2 6(0) 

and our aim is to have 

rdrd9 = A(f) 2 fdfd9. (64) 

Combining (63) and (64), we see that 

A{fff = R{r)R'{f)^ (65) 
a 



2l2 

a o 



— 6'(0) = 7. (66) 



a 2 sin 2 6(0) + b 2 cos 2 6(0) 

where 7 is a constant determined by a boundary condition to be discussed shortly. Integrating 
differential equation (66), we get that 



= Y tan_1 (j tan ^) • (67) 
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Since = 7r/2 when 9 — ir/2, it follows that 

7 = ab. (68) 

Hence, 

6(0) = tan" 1 (^tanfl) , (69) 

6(0) = tan" 1 Qtan^ , (70) 
and i?(f) is determined by integrating 



A(f) 2 f = R(f)R'{r)^. (71) 

Converting polar functions i?(f) and 6(0) into cartesian equations for x and y as functions of x 
and y, we can finally calculate the mirror shapes using the differential equations in Theorem 1 . 



9. Off -Axis Performance 

In this section we do a careful ray-trace analysis for an off-axis source. Suppose that the 
infinitely remote source lies in the (x, z) -plane and is oriented at an angle <\> from vertical so that 
its unit incidence vector is (— sin 0, 0, cos 0). To keep the analysis manageable, we will only trace 
rays in the (x, z)-plane. In polar coordinates, this is the (r, ^)-plane and = 0. The incoming rays 
can be parametrized by the rr-coordinate at which they hit the first mirror. Fix an x and consider 
such a ray. A ray trace is shown in Figure 7. Throughout this section, anytime a function is a 
function of y or y (among other variables), we will suppress this dependence since these variables 
are zero in this section. Also, for notational convenience we extend the definitions of functions R 
and R to negative values by making the functions odd: 

R(-r) = -R(f) and R(-r) = -R(r). (72) 

Since the rays are entering the system at an angle, the place where the the reflection hits the 
second mirror is displaced, say by Ax, from the point x where an on-axis ray hit this second mirror. 
We begin with this displacement. 

Lemma 3 Ax = S(x, x)<p + o(0). 



Proof. This is exactly Lemma 1 in Traub and Vanderbei (2003). 



□ 



-17- 



A second quantity of importance is the angle at which a light ray reflects off from the second 
mirror. We are interested in how this angle depends on the position x and angle at which it hit 
the first mirror: 

Lemma 4 = -J^- y o(0). 

R'(x) 



Proof. This is essentially Lemma 2 from Traub and Vanderbei (2003). More specifically, it 
follows directly from equations (15) and (16) in the proof of Lemma 2 in Traub and Vanderbei 
(2003) that 

dx/dx A dx 
» ^7^A5 = — 0. (73) 
b(x, x) dx 

Then, dx/dx{x) = R'(x) = 1/R'{x). □ 



Lemmas 3 and 4 tell us that a ray incident on the first mirror at re-coordinate x and angle 
bounces off the second mirror at re-coordinate x(x) + 50 = R(x) + 5 + 50 and at angle 0/ ' R'{x). 
We summarize this by writing 

{x^}* 4 ^ 42 {R(x) + 5 + S(x,x)<f>, -J^j. (74) 

Lemma 4 is both good and bad. It is good because, for apodizations of interest, R'(x) is less 
than one for most re's and hence there is a built-in magnification — off-axis rays come out of the 
system at a steeper angle than they had on entry. It is bad because the rays are no longer parallel 
and therefore cannot be focused to a diffraction-limited image. 

We discuss in detail the issue of nonparallel rays, and how to remedy it, in the next section. 
We end this section with some further discussion of the magnification effect. It turns out to be most 
convenient to consider parametrized over f rather than over r. That is, we are interested in 

(75) 



R'{R{r)) 

R'(f)(f) (76) 
yj r Q 2A{ 8 )*8d8 
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Here, the last equality follows from (35). To arrive at an average magnification, we take an 
intensity-weighted average of 0(f)/0: 

/° 4>{r)A{f) 2 fdr 
magnif = — — ~ — . (78) 

f A(f) 2 fdf 



9.1. Pupil Restoration by System Reversal 

Guyon (2003) addresses this off-axis defocus question. He recommends placing a focusing 
element L\ after M 2 followed by a star occulter in the image plane, a beam-recollimating ele- 
ment £ 2 , and finally a pupil mapping system identical to the first two mirrors but set up exactly 
backwards. Let's refer to these last two mirrors as M3 and Ma- If the focusing and recollimating 
elements {C\ and £ 2 ) are assumed to be ideal lenses having a common focal length, say /, and if 
the distances from M2 to L\ and from C 2 to M3 are both also chosen to be /, then the pupil at 
mirror TV! 3 is a reimaging of the pupil at M2 and so a ray hitting M 2 at, say, position x and angle 
will hit M3 at position 25 — x and angle — 0. We summarize this as 



X,(p 



[25 (79) 



(see Figure 8). 



Figure 9 shows a version of the full system in which the reversed system folds back to the left. 
The following theorem describes how an off-axis ray propagates from mirror M3 to Ma- 

Theorem 3 For the system shown in Figure 9, a ray propagates from mirror A4 3 to mirror M. 4 as 
follows: 



|x,0} 



_ S) + S(x, 1)0, J ) (80) 



where x denotes the point on mirror M.^ corresponding to an on-axis ray impinging on mirror M.3 
at x. 



Proof. Because the reversed pupil mapping system (M.3, Ma) is identical to the first one, we start 
by inverting the operation given by (74). This inversion describes a ray propagating backwards 
through mirrors M2 and Mi. 

From (74), we have that 
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Hence, 

= R'(x)4>. (82) 

We need to express R\x) in tenns of x (and perhaps 0). To this end, we use equations (5), (6), and 
(7) to write 

x = f + 5 = R(r) + 5 = R(R(f)) + 5 = R(R(x - 5)) + 5. (83) 
Differentiating with respect to x, we get 

1 = R!{R{x-5))R\x-5). (84) 

Hence, 

R'(x)R'(x -5) = 1 (85) 

and so we get that 

* = mhsy (86) 

Now, we remind the reader that our propagations represent just the constant and linear terms in an 
expansion in the small angular parameters <fi and 0. Since these angles are small, it follows that 
x — x is also small. We retain terms that are linear in these small parameters but we drop higher 
order terms. Hence, since the right-hand side in (86) already is small, we can simply replace 
R'(x-S) with R'(x-S). 

From (74), we also have that 

I = R(x) + S + S{x,x)(j). (87) 

We need to solve this for x. From (8), we see that 

x = R(x-6 - S(x,x)<f)) (88) 

w R(x-5)-R'(x-5)S(x,x)(/) (89) 

= R(x-5)-S(x,x)4> (90) 

w R(x-5) -S(x,x)<j>, (91) 

where x denotes the point on mirror M i corresponding to an on-axis ray impinging on mirror M.2 
at x. 

From (86) and (91), we see that backwards propagation through the M1-M2 system is given 

by 

[i,}} M ^L(x-5)-S(x,x)l }• (92) 
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To describe propagation through the M.3-M.4 system, we have to flip this system about a horizontal 
axis, apply the above transformation, and then flip back. The flip operation leaves horizontal 
coordinates unchanged but negates angles. Hence, we get 

— (93) 

_ S) + S(x, 1)0, - t 7\ [ (94) 
it'(x — 0) J 

_ 5) + 1)0, / I . (95) 
R'(x — 0) J 

This completes the proof. □ 



We are now ready to combine the above ray propagation results to describe propagation 
through the entire system: 

Theorem 4 For the system shown in Figure 9, a ray entering the system at position x and angle 

n S(x,x) , , , — 

<p, exits the system at position —x — 2— ana at angle 



*W ' " l-§^S(x,x)<p 



Proof. We can compose the maps given by (74), (79), and (80) to get that 

{x,<p} — (it>(x) + <5 + ,S(x,x)0, -J-) (96) 
I R'{x) J 

-R(x) + 5-S(x,x)<f>, (97) 

R'(x) ) 

R (-R(x) - S(x, 5)0) - gfc^ __ V 98 ) 



R'( x ) R'(x)R' (-R(x) - S(x, x)(f?j 

Since R(R(x)) = x, we can differentiate this identity to derive simple identities relating the deriva- 
tives of R(R(x)) to the derivatives of R(x). From these identities, we easily see that 

R(R(x)+S(x,x)(f^ = R(R(x)) +R'(R(x))S(x,Z)<f) + o(<f)) (99) 

= x + 2^>0 + o(0) (100) 
R'(x) 
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and 




R'(R(x)) + R"(R(x))S(x, x)<j> + o(0) 



(101) 



1 R"(x) 



-S(x,x)(f) + o((f)) 



(102) 



R'(x) R'(x) 



Using the fact that R is an odd function, and therefore that R' is even, we substitute (100) and 



A comparison of the exiting ray angles from the 4-mirror corrected system (Theorem 4) com- 
pared to those from the 2-mirror uncorrected system (Lemma 2) shows that the relative scatter of 
ray angles will be smaller in the corrected system by a factor which depends on the details of the 
function R, but which in general will be roughly a factor of a few times the off-axis angle 0; for 
typical planet- searching angles of a few arcseconds, this ratio could easily be less than 1CT 3 , which 
is a significant reduction in scatter of ray angles. (Specific estimates will be given in a future paper 
with numerical results.) 

Figure 10 shows a version of the full system in which the reversed pupil mapping folds off 
to the right. This case is just like the previous one except that there is effectively a symmetry 
reflection about the vertical axis x = 5 that must be applied before entering and after leaving the 
second system. Flipping about this vertical axis has the effect given by (79). 

Theorem 5 For the system shown in Figure 10, a ray entering the system at position x and angle 

(j), exits the system at position 25 — x — — and at angle — — . 

R '( x ) l-§^S(x,x)<p 

Proof. We need to compose the maps given by (74), (79), (79), (80), and (79). Of course, applying 
(79) twice in a row simply undoes the effect and so we can simply form the composition of (74), 



(102) into (98) to get the results claimed in the theorem. 



□ 
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(80), and (79): 



{x,<P} — [R(x)+6 + S(x,x)<j>, -~|-} (103) 

R (r(x) + S(x, 5)0) + ^ . 1 (104) 

V J R'{x) R>(x)Ri (R(x) + S{x,x)(j)) J 

x + 2^0, * > (105) 

25 -x -2^0, * }. (106) 



□ 



Since the estimated aberrations a; and are the same in Theorems 4 and 5 (cf. Figures 9 and 10), 
we see that these correcting schemes are equivalent in terms of their ability to reduce aberrations. 

Finally, we remark that our analysis has ignored the beam walk that would be introduced by 
the fact that mirrors _M 2 and M s are not everywhere a distance / from the corresponding lens. 



10. Summary 

We derived equations for the shapes of mirrors and lenses capable of converting a uniform- 
intensity beam into a shaped-intensity beam (the pupil-mapping process). We gave analytical 
estimates of the aberrations of a 2-mirror system, and the improved case of an aberration-corrected 
4-mirror system. We applied the results to several examples, including the familiar Cassegrain- 
Gregorian telescope designs, as well as beam shapes given by Gaussian and prolate spheroidal 
functions. The general equations will allow the design of many types of optical systems, but in 
particular should be helpful in designing optics for telescopic searches for extrasolar planets. 
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Fig. 1 . — This figure shows the family of 6 types of 2-mirror systems that can be obtained from the 
requirement that an input beam with a fiat wavefront be mapped to an output beam also with a flat 
wavefront but with a uniform relative intensity of a 2 . The short dashed lines show the large-scale 
surface of revolution of each paraboloid. The heavy solid line shows the actual mirror surface. 
The long dashed line shows the common axis of revolution of the surfaces (outside the panel in 
the upper 3 panels). In each case the vertices have the same vertical separation (Pq/2 = 3.5) 
and horizontal separation (5 = 1.1), and the diameter of the larger mirror is 1. Input is from the 
upper left, and output is to the lower right, (a) Cassegrain with a > 1; confocal paraboloids, 
with common focus outside the panel (above and right), (b) Gregorian with a < — 1; confocal 
paraboloids with common focus between the mirrors and on the axis of each mirror, as shown, (c) 
Symmetric Cassegrain (periscope), a — 1; two flat mirrors, common axis and focus at infinity, (d) 
Symmetric Gregorian, a — — 1; confocal paraboloids, with focal points geometrically centered as 
shown, (e) Inverted Cassegrain, < a < 1; confocal paraboloids with common focus outside 
panel, exactly the same as (a) for reversed direction of light beam and a — > I /a. (f) Inverted 
Gregorian, — 1 < a < 0; confocal paraboloids with focus between the mirrors, same as (b) for 
reversed light and inverse a. 
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Fig. 2. — A Cassegrain design. Parallel light rays come down from above, reflect off the bottom 
mirror, bounce up to the top mirror, and then exit downward as a parallel bundle with a concentra- 
tion of rays in the center of the bundle and thinning out toward the edges — that is, the exit bundle 
is apodized, but with no loss of light. 



Fig. 3. — A Gregorian design. Similar to Figure 2, but since both mirrors are concave there is an 
intermediate focus, albeit a poor focus with a large halo. Nevertheless the rays emerge parallel, 
just as in Figure 2. An advantage here might be in manufacturability. 
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Fig. 4. — An energy conserving apodization providing contrast of 1CT 10 from AX/D to 60A/.D. 
As explained in Section 9 (and in Traub and Vanderbei (2003)), the unitless angle AX/D only 
corresponds to a sky angle in the case where the apodization function is identically one. For non- 
trivial apodizations, such as this one, the off-axis rays get magnified by a factor related to A{r). 
Hence, the intensity-weighted average magnification given by (78) (using (77)) evaluates to 2.16 
and therefore AX/D corresponds to (4/2.16)A/D = 1.85X/D as an angle on the sky. 
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Fig. 5. — The on-axis point spread function for the apodization shown in Figure 4. High contrast 
for an on-axis point source occurs at AX/ D. But, as explained in the previous caption, an off-axis 
source such as a planet having, say, an angle of 2X/D in the sky will appear mostly at 2 x 2.16 or 
4.32A/.D and is therefore detectable in principle. 



-30- 




Fig. 6. — Pupil mapping via a pair of properly figured lenses. This Galilean arrangement, with 
one convex and one concave lens, is the result of using c > in the Gaussian mapping function in 
Eqns. (47)-(49). If we had used c < the resulting lenses would both be convex, and the beam 
would have had a waist (an approximate focus) between the lenses. 
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Fig. 7. — Ray-trace of an off-axis source as it passes through a two-mirror pupil mapper. 
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Fig. 8. — Ray-trace of an off-axis source from M. 2 to for a 4-mirror system. Note that by 
placing the lenses halfway between their corresponding mirrors and the image plane and choosing 
their focal lengths to be this distance, we get that the image of mirror M. 2 is exactly at A4 3 . Hence, 
each ray maps to a position reflected through the optical axis and comes out with the opposite 
direction. 
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Fig. 9. — Ray-trace for the full 4-mirror system: two-mirror pupil mapper, lens system, and re- 
versed two-mirror pupil mapper. An occulting spot is shown at the center of the image plane to 
block on-axis starlight. 




Fig. 10. — Ray-trace for the full 4-mirror system in which the reversed system continues to the 
right rather than folding back to the left. 
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Quantity (units) 


a < -1 


a = — 1 


-1 < a < 


< a < 1 


a = 1 


a > 1 


^min = 5^max ($) 
H m [ n (2^) 

sign of F 
sign of F 
type of system 


1 to 1/2 1/2 1/2 to Oto-00 ±00 ootol 
-1 to -1/2 -1/2 -1/2 to Otooo ±00 -ooto-1 
+ + + 00 + 
+ + + + 00 
Greg. eq. Greg. inv. Greg. Cass. eq. Cass. inv. Cass. 



Table 1: Characteristics of 2-mirror afocal systems, generated from the amplitude mapping func- 
tion A(f) = a. Here, inv. means inverted, eq. means equal and refers to relative mirror sizes, 
Greg, means Gregorian, and Cass, means Cassegrain. Note that the "equal Cass." is a plane- 
mirror periscope. 



